Bayesian Optimization with Missing Inputs
نویسندگان
چکیده
Bayesian optimization (BO) is an efficient method for optimizing expensive black-box functions. In real-world applications, BO often faces a major problem of missing values in inputs. The inputs can happen two cases. First, the historical data training contain values. Second, when performing function evaluation (e.g., computing alloy strength heat treatment process), errors may occur thermostat stops working) leading to erroneous situation where computed at random unknown value instead suggested value. To deal with this problem, common approach just simply skips points happen. Clearly, naive cannot utilize efficiently and leads poor performance. paper, we propose novel handle We first find probability distribution each so that impute by drawing sample from its distribution. then develop new acquisition based on well-known Upper Confidence Bound (UCB) function, which considers uncertainty imputed suggesting next point evaluation. conduct comprehensive experiments both synthetic applications show usefulness our method.
منابع مشابه
Prediction with Missing Inputs
For the purposes of this paper, data mining involves: Predictive modeling (Estimating parameters is not of major interest) Nonnormal data with nonlinear relationships Large data sets This paper will not cover issues regarding small data sets, such as Bayesian predictive distributions, or tree-based models, for which specialized methods are available for handling missing data. Missing data are a...
متن کاملOptimization with missing data
Engineering optimization relies routinely on deterministic computer based design evaluations, typically comprising geometry creation, mesh generation and numerical simulation. Simple optimization routines tend to stall and require user intervention if a failure occurs at any of these stages. This motivated us to develop an optimization strategy based on surrogate modelling, which penalizes the ...
متن کاملA BAYESIAN APPROACH TO COMPUTING MISSING REGRESSOR VALUES
In this article, Lindley's measure of average information is used to measure the information contained in incomplete observations on the vector of unknown regression coefficients [9]. This measure of information may be used to compute the missing regressor values.
متن کاملNuclear norm system identification with missing inputs and outputs
We present a system identification method for problems with partially missing inputs and outputs. The method is based on a subspace formulation and uses the nuclear norm heuristic for structured low-rank matrix approximation, with the missing input and output values as the optimization variables. We also present a fast implementation of the alternating direction method of multipliers (ADMM) to ...
متن کاملBayesian Clustering with Outliers and Missing Values
The Bayesian Robust Mixture Model (BRMM) is a fully probabilistic model for grouping realvalued data into a finite number of clusters. The model is robust in the sense that it tolerates outliers in the data and handles missing values, both within the Bayesian inference framework. Foreword The purpose of this report is to provide a detailed, step-by-step derivation of the variational update equa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2021
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-030-67661-2_41